21 198 282

lhl PRO

leonardlin

https://randomfoo.net/

AI & ML interests

None yet

Recent Activity

updated a model about 1 hour ago

shisa-ai/shisa-v2-llama3.1-405b-GGUF

liked a model about 8 hours ago

stockmark/Stockmark-2-VL-100B-beta

updated a model about 12 hours ago

shisa-ai/shisa-v2-llama3.1-405b-GGUF

View all activity

Organizations

Posts 9

Post

2508

BTW, in case anyone wants to kick the tires, test their 日本語, I have our Shisa V2 405B model up and running temporarily: https://chat.shisa.ai/

View all Posts

Articles 3

Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

View all Articles

Collections 25

spaces 1

Runtime error

Shisa Ablations

💬

models 1

leonardlin/ablation-198-a194.finaldpo2.2e7-shisa-v2-qwen2.5-32b

Updated Apr 18

datasets 0

None public yet

lhl PRO

AI & ML interests

Recent Activity

Organizations

Posts 9

Articles 3

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Collections 25

shisa-ai/shisa-v2-qwen2.5-7b

shisa-ai/shisa-v2-llama3.1-8b

shisa-ai/shisa-v2-llama3.1-8b-preview

sbintuitions/sarashina2.2-3b-instruct-v0.1

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

Accelerating LLM Inference with Staged Speculative Decoding

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

spaces 1

Shisa Ablations

models 1

leonardlin/ablation-198-a194.finaldpo2.2e7-shisa-v2-qwen2.5-32b

datasets 0